Approximate Sampling Formulae for General Finite-alleles Models of Mutation
نویسندگان
چکیده
Many applications in genetic analyses utilize sampling distributions, which describe the probability of observing a sample of DNA sequences randomly drawn from a population. In the one-locus case with special models of mutation, such as the infinite-alleles model or the finite-alleles parent-independent mutation model, closed-form sampling distributions under the coalescent have been known for many decades. However, no exact formula is currently known for more general models of mutation that are of biological interest. In this paper, models with finitely-many alleles are considered, and an urn construction related to the coalescent is used to derive approximate closed-form sampling formulae for an arbitrary irreducible recurrent mutation model or for a reversible recurrent mutation model, depending on whether the number of distinct observed allele types is at most three or four, respectively. It is demonstrated empirically that the formulae derived here are highly accurate when the per-base mutation rate is low, which holds for many biological organisms.
منابع مشابه
Approximate Sampling Formulas for General Finite-alleles Models of Mutation.
Many applications in genetic analyses utilize sampling distributions, which describe the probability of observing a sample of DNA sequences randomly drawn from a population. In the one-locus case with special models of mutation such as the infinite-alleles model or the finite-alleles parent-independent mutation model, closed-form sampling distributions under the coalescent have been known for m...
متن کاملApproximate Closed-form Formulae for Buckling Analysis of Rectangular Tubes under Torsion
The buckling torque may be much less than the yield torque in very thin rectangular tubes under torsion. In this paper, simple closed-form formulae are presented for buckling analysis of long hollow rectangular tubes under torsion. By the presented formulae, one can obtain the critical torque or the critical angle of twist of the tube in terms of its geometrical parameters and material constant...
متن کاملGenealogies of regular exchangeable coalescents with applications to sampling
This article considers a model of genealogy corresponding to a regular exchangeable coalescent (also known as Ξ-coalescent) started from a large finite configuration, and undergoing neutral mutations. Asymptotic expressions for the number of active lineages were obtained by the author in a previous work. Analogous results for the number of active mutationfree lineages and the combined lineage l...
متن کاملRandom evolutionary dynamics driven by fitness and house-of-cards mutations. Sampling formulae
We first revisit the multi-allelic mutation-fitness balance problem, especially when mutations obey a house of cards condition, where the discrete-time deterministic evolutionary dynamics of the allelic frequencies derives from a Shahshahani potential. We then consider multi-allelic WrightFisher stochastic models whose deviation to neutrality is from the Shahshahani mutation/selection potential...
متن کاملA principled approach to deriving approximate conditional sampling distributions in population genetics models with recombination.
The multilocus conditional sampling distribution (CSD) describes the probability that an additionally sampled DNA sequence is of a certain type, given that a collection of sequences has already been observed. The CSD has a wide range of applications in both computational biology and population genomics analysis, including phasing genotype data into haplotype data, imputing missing data, estimat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012